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ABSTRACT 

We attempt to measure possible miscalibration of the wavelength scale of the VLT-UVES spectro- 
graph. We take spectra of QSO HE0515-4414 through the UVES iodine cell which contains thousands 
of well-calibrated iodine lines and compare these lines to the wavelength scale from the standard 
thorium-argon pipeline calibration. Analyzing three exposures of this z = 1.71 QSO, we find two 
distinct types of calibration shifts needed to correct the Th/Ar wavelength scale. First, there is an 
overall average velocity shift of between 100 ms"-'^ and 500 ms^^ depending upon the exposure. Sec- 
ond, within a given exposure, we find intra-order velocity distortions of 100 ms~^ up to more than 200 
ms~^. These calibration errors are similar to, but smaller than, those found earlier in the Keck HIRES 
spectrometer. We discuss the possible origins of these two types of miscalibration. We also explore 
the implications of these calibration errors on the systematic error in measurements of ^ , the change 
in the fine-structure constant derived from measurement of the relative rcdshifts of absorption lines in 
QSO absorption systems. The overall average, exposure-dependent shifts should be less relevant for 
fine-structure work, but the intra-order shifts have the potential to affect these results. Using cither 
our measured calibration offsets or a Gaussian model with sigma of around 90 ms~^, Monte Carlo 



mock experiments find errors in — of between 1 x IQ-^N^yl'"^ and 3 x lO-^N^yi'\ where N^y^ is the 
number of systems used and the range is due to dependence on how many metallic absorption lines 
in each system are compared. 
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1. INTRODUCTION 

In a recent paper (Griest et al. 2010) it was shown that 
Keck HIRES spectrograph measurements of absorption 
features in high redshift QSOs calibrated with the nor- 
mal thorium-argon (Th/Ar) technique contained overall 
average velocity shifts between exposures of up to 2000 
ms~^ over several nights and calibration errors of up to 
500 ms~^ even within a single echelle order in a single 
exposure. Similar wavelength calibration problems were 
found previously by Osterbrock et al. (2000) and Suzuki 
et al. (2003). These results were found by re-calibrating 
the wavelength scale using spectra taken through the 
Keck HIRES iodine cell. The iodine cell, which has been 
used extensively in the Doppler method search for extra- 
solar planets, puts several sharp lines per Angstrom on 
top of the QSO spectrum, and its reproducible absorp- 
tion spectrum allows extrasolar planet workers to attain 
a relative velocity precision of around 5 ms~^ between 
exposures taken up to several years apart (Johnson et 
al. 2006; Konacki 2009). The analysis in Griest et al. 
(2010) compared the iodine cell wavelength calibration to 
Th/Ar calibration with two separate Keck HIRES anal- 
ysis pipelines (XIDL and MAKEE) and so is probably 
measuring calibration problems with the Th/Ar calibra- 
tion of the spectrograph itself rather than problems just 
with the standard calibration software. 

For most astronomical work, wavelength calibration er- 
rors of around 1 kms~^ are not important, but over the 
past few years several groups have used absorption lines 



in the spectra of high redshift QSOs to measure or place 
limits on possible changes in the fine-structure constant 
over cosmological time. There has been considerable con- 
troversy in experimental measurements of — using high 
redshift absorption systems, with claims of detection and 
also limits on variation inconsistent with those claims. 
See Murphy et al. (2008) for a summary. 

If the fine-structure constant was different 10 billion 
years ago by the claimed detection of ^ = (— 5.7 ± 
1.1) X 10"^ (Murphy et al. 2003, 2004), relative atomic 
transition wavelengths are expected to differ from their 
lab values by up to ^200 ms~^. For such experimental 
uses, it is important to pay careful attention to calibra- 
tion errors of the size reported above. However, it is 
important to understand that while individual measure- 
ment errors may be larger than the expected signal, if 
many atomic transitions are used in many QSO absorp- 
tion systems, it may be possible to average away these 
calibration uncertainties. See, for example. Murphy et 
al. (2009). 

The results of Griest et al. (2010) apply only to the 
Keck HIRES instrument, but there is another instru- 
ment, the VLT UVES spectrograph, that is playing a 
key role in the search for possible changes in the fine- 
structure constant using absorption lines in high redshift 
QSOs. In this paper, we perform a similar recalibra- 
tion of the standard UVES Th/Ar wavelength calibra- 
tion pipeline using the VLT iodine cell. Wc find similar, 
but smaller, wavelength calibration errors than found in 



HIRES. We discuss the possible origin of these offsets, 
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Table 1 

Journal of Observations 



Exposure and Date 


Time (UT) 


Th/Ar Time (UT) 


1 2003 Oct 11 


07:32 


10:56 


2 2003 Oct 11 


08:13 


10:56 


3 2003 Oct 13 


05:31 


07:06 



portant systematic errors in the measurements to date of 

Aa 
a ' 



2. OBSERVATIONS 

Six exposures were taken of the quasar HE0515-4414 
{z = 1.71, V w 14.9 mag) with the VLT-UVES spectro- 
graph in 2003 October. In this paper, we analyze the 
wavelength cahbration of the three exposures that were 
taken with the iodine cell in place. We include a journal 
of these observations in Table [T] Over the wavelength 
range of interest, the median signal/noise of the spectra 
extracted from these exposures is around 20 pixel" ^ for 
the upper "u" chip and around 11 pixel" ^ for the lower 
"1" chip. One pixel corresponds to about 1.5 kms~^ at 
the leading edge of each echelle order and around 0.9 
kms~^ at the trailing edge. We note that ESO's spec- 
ifications for UVES are that the gratings, after being 
moved, be returned to the same position to within a toler- 
ance corresponding to 0.1 pixels (D'Odorico et al. 2000). 
Thus naively we might expect to see an overall non-zero 
velocity calibration shift between the iodine and Th/Ar 
lines of roughly 140 ms^^ at 5500 A. The six QSO ex- 
posures were taken during two nights, with the first two 
I2 QSO exposures being taken on the first night and the 
third I2 exposure taken on the next. The first two ex- 
posures were calibrated with the same Th/Ar exposure; 
however we note that the gratings were moved after the 
two data exposures and then moved back to the same 
position in order to take the Th/Ar exposure. The third 
I2 QSO exposure was followed first by a non-l2 QSO ex- 
posure (same QSO, grating setting, etc.) and then by the 
Th/Ar calibration exposure. We had hoped this schedul- 
ing would remove a possible source of error caused by 
grating movement, but in fact, the third QSO I2 expo- 
sure was part of a different "observation block" , mean- 
ing the gratings were reset (moved and then returned) 
between the data and Th/Ar exposures. We therefore 
expect overall velocity shifts between the I2 and Th/Ar 
wavelength scales of order 140 ms~^ for all three of our 
QSO exposures through the I2 cell. 

We can also estimate an overall average velocity shift 
having to do with the position of the QSO in the spectro- 
graph slit. The slit width is 0.7" and our exposures were 
taken with seeing between 0.65" and 0.85". This slit- 
width projects onto the CCD with an FWHM of about 
4.8 kms"'' (i? ~ 62,000). Therefore, if a given expo- 
sure has, for example, a 0.1" positioning error, we might 
expect a roughly 600 ms~^ overall calibration shift, sub- 
stantially larger than the error caused by the resetting of 
the spectrometer grating. 

In more detail, we note that each 2400 s I2 exposure 
used only the red arm of UVES in the standard 600 nm 
central wavelength setting for I2 observations. The red 
arm of UVES has a detector containing two CCD detec- 
tors covering the wavelength ranges 496-597 and 599- 



707 nm. No on-chip binning was used; the pixels have a 
width of « 1.3kms~^, providing ss3.7 pixels per FWHM 
resolution element. A circular baffle, or "pupil stop", is 
used routinely in UVES to provide a beam from calibra- 
tion lamps (e.g., Th/Ar, fiat field) similar in size to that 
from the telescope from an astronomical point source. 
Our I2 exposures were taken with an under-sized pupil 
stop, i.e., a slightly smaller beam than usual was allowed 
into UVES, but our Th/Ar exposures used a slightly 
over-sized pupil stop. While these are the default set- 
tings for I2 observations, strictly speaking our aim here 
is to treat the I2 and Th/Ar exposures as similarly as pos- 
sible so that any wavelength shifts between the two are 
appropriate to normal QSO observations where a slightly 
"oversize" pupil stop is used for both object and calibra- 
tion exposures. However, we would not expect slight, 
circular vignetting/truncation of the beam to affect our 
results here. Indeed, subsequent iodine-cell tests with 
UVES have shown any effect from the under-sized pupil 
stop to be very small, if present at all; these results will 
be presented in a forthcoming paper. 

3. DATA REDUCTION AND ANALYSIS 

The QSO flux was extracted using the standard 
pipeline recipes. Five bias and flat field exposures were 
median filtered to produce master bias and fiat field cor- 
rections. The echelle order positions and overall spec- 
trograph setup were derived from short-and-narrow slit 
exposures of quartz and Th/Ar lamps which were used to 
refine a physical model for the expected fiux distribution 
within each order. In our exposures, the QSO flux had 
high enough signal/noise in each echelle order to allow 
the flux distribution itself to define its spatial profile for 
use as object weights in the subsequent optimal extrac- 
tion. No redispersion of the spectra was performed after 
optimal extraction; each extracted echelle order retained 
its original wavelength dispersion as a function of pixel 
position. Rather than co-adding extracted QSO spectra, 
we treated each order of each exposure separately so that 
each I2 (and accompanying Th/Ar) exposure gives a sep- 
arate measurement of the Th/Ar wavelength calibration 
shifts for each order. 

The UVES Common Pipeline Language (CPL) soft- 
ware package includes considerable improvements to the 
wavelength calibration process compared to previous 
UVES reduction pipelines. The Th/Ar line list is only 
a small subset of all known Th/Ar lines in the relevant 
wavelength range and was derived via an objective line 
selection algorithm detailed in Murphy et al. (2007). An 
even smaller subset is used in the final calibration, after 
the CPL pipeline removes those appearing too weak or 
strong/saturated in the individual Th/Ar spectra. The 
Th/Ar exposures were bias- and (normalized) fiat field 
corrected during the extraction. The extractions used 
the spatial weighting profile derived from the optimal 
extraction of the QSO fiux from the corresponding I2 ex- 
posure. This ensures that the same pixels, with the same 
weighting, contribute to both the QSO and Th/Ar spec- 
trum and naturally avoid calibration errors from small 
tilts of the Th/Ar lines with respect to the CCD grid. 
A crude blaze correction was made using the fiat field 
spectral shape in each echelle order. To determine their 
centroid in the wavelength calibration process, the CPL 
pipeline fits Gaussian functions to the selected Th/Ar 
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emission lines. The Gaussian model includes an under- 
lying linear (rather than just constant offset) continuum 
level; this reduces centroiding errors induced by other 
nearby emission lines or other sources of background 
slopes, e.g., residual blaze function (Murphy et al. 2007). 

The wavelength calibration residuals around the de- 
fault two-dimensional fourth-order polynomial wave- 
length solution had an rms of ~ 30-40 ms~^, similar 
to that found from the unbinned Th/Ar exposures by 
Murphy et al. (2007). We also performed wavelength 
calibration using higher order polynomials and we study 
the effect of this in Section 4.1. 

To perform the wavelength recalibration using the io- 
dine lines, we used a method similar to that used in Gri- 
est et al. (2010). A well measured iodine cell absorption 
spectrum taken elsewhere is convolved with a Gaussian 
to give it the same resolution as the iodine lines in our 
QSO spectra. It is then multiplied by an overall normal- 
ization and shifted in wavelength by an amount that gives 
the smallest possible in the difference between the 
convolved iodine spectrum and the QSO spectra. Care 
is taken in the continua fitting as discussed in Griest et al. 
(2010), since the iodine lines cover nearly the entire spec- 
trum. For the reference iodine spectra we tried both the 
Marcy and Butler (Butler, et al., 1996; Marcy 2008, un- 
published) Fourier Transform Spectrograph (FTS) spec- 
trum of the Keck HIRES iodine cell done at KPNO with 
a resolving power, R = 170, 000 and a signal/noise of 700 
pixel" ^, and the UVES iodine cell calibration spectrum|3 
performed at 70°C, with a spectral resolution 0.020 cm~^ 
(which implies R > 1,000,000 throughout the effective 
I2 wavelength range). Besides the difference in resolution 
and a single shift in the absolute scale, the resulting cal- 
ibration shifts were fairly similar using the two different 
reference spectra. For UVES iodine cell data, it is clearly 
more appropriate to use the UVES FTS iodine cell spec- 
trum so we will only present results obtained using this 
spectrum. 

4. RESULTS 

The result of our analysis is a wavelength-dependent 
velocity shift Ushift(A) between the iodine cell value, 
which is presumed to be correct, and the Th/Ar value 
output by the UVES pipeline software, 

Al(A) = AThAr(A) + Ushift(A), (1) 

where there is one such velocity shift function for each 
echelle order of each exposure. We use the iodine cell 
calibration as the standard since it has a much higher 
density of lines and because the QSO and the iodine 
light follow the same optical path, have the same in- 
strument illumination, and arc simultaneous. Additional 
evidence in support of the superiority of the iodine cell 
wavelength scale and discussion of whether Wghift arises 
primarily from the extraction software or from optical 
distortion inside the spectrograph are given below. In 
fact, it is not important to our work that the FTS io- 
dine spectrum be absolutely correct, since we only care 
about relative shifts across an individual exposure, and 
perhaps shifts over time. And since the extraction of 
the Th/Ar spectra was performed using the same object 

^ http: / / www.cso.org/ sci / facilities / paranal/instrumcnts / uvcs / tools 
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Figure 1. Shift in ms-^ needed to bring the Th/Ar VLT UVES 
calibration in line with the iodine spectrum. This figure shows 
results for the three exposures orders of the UVES upper "u" CCD 
chip, with blue and red colors alternating for each echcUe order. 
The black dots show the weighted average of the shifts in each 
order, with error bars showing the average la fit error for each 
order. 

weights as that of the QSO flux, the I2 and Th/Ar spec- 
tra are treated as similarly as possible in the reduction 
process, meaning that any wavelength distortions derived 
should be driven by optical distortions rather than any 
deficiencies of the reduction pipeline. 

The fit for the Wghift calibration offset is affected by the 
size of the wavelength bin used in the comparison, with a 
larger bin giving a smaller formal fit error but less resolu- 
tion on the wavelength scale over which calibration errors 
occur. As a compromise we use a bin of 350 kms~^ or 
GA at 5500A. We translate these wavelength calibration 
shifts into velocity using Vshut = cAA/A and display the 
result for each exposure in Figures [T] and [21 

Considering each order from each iodine exposure sep- 
arately, we began by continuum fitting the exposure and 
masking the data that fall under strong quasar absorp- 
tion lines and data near the edges of the order where 
signal/noise dropped to < 8 pixel^^. Considering the re- 
maining data as one large wavelength bin, we performed 
a simultaneous fit for three variables: the overall nor- 
malization multiplication factor to the continuum, the 
Gaussian convolution kernel, and the overall wavelength 
shift. The wavelength offset found this way is in some 
sense the average offset for the entire order. We then held 
the Gaussian convolution sigma and normalization factor 
fixed for the whole order and fit for the wavelength cal- 
ibration shifts in the smaller wavelength bins across the 
order. Wc fit using a bin size of 350 km s~^ and report the 
value of the bin at the average wavelength value within 
the bin. The bins overlap which means the wavelength 
calibration has effectively gone through a smoothing fil- 
ter of 350 kms~^. 

One sees from Figures [1] and [2] that there arc two dis- 
tinct types of velocity shifts. First, there is an exposure- 
dependent overall average calibration shift which varies 
from less than 100 ms~^ in exposure 2, to around 500 
ms~^ in exposure 3. Second, there are intra-order veloc- 
ity shifts around the overall average shift ranging in size 
from around 100 ms^^ up to around 400 ms"^ within 
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Figure 2. Same as the previous figure, but for the lower "1" 
chip. 
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each exposure and within each echelle order. Both types 
of shifts were also seen in the Keck HIRES iodine cell 
data (Griest et al. 2010), but with considerably larger 
amplitude. Exposures 1 and 2 have an overall average 
shift of around 100 ms~^. while exposure 3, taken on 
the next day is off by about 500 ms~^. It is important 
to note that the overall average velocity shifts between 
exposures do not affect fine-structure constant work sig- 
nificantly; velocity shifts which vary with wavelength - 
i.e., effects which would shift one absorption feature with 
respect to another at a different wavelength - are most 
important (e.g., Webb et al. 1999; Murphy et al. 2009). 
Thus, we are most interested in the relative intra-order 
shifts within each exposure of typically 100 ms~^ and 
occasionally as high as 500 ms~^. The cause of these 
intra-order shifts is not completely understood, but some 
possibilities will be discussed below. 

Another way of visualizing the velocity shifts in Figures 
[T] and [2] is to plot the shift distributions. These are shown 
in Figure[3]which are just histograms of the shifts for each 
of the exposures. The shift from each fine wavelength bin 
is given equal weight in the figure, which, if normalized, 
can therefore be interpreted as a sort of probability of 
finding a shift of that value. We use this histogram in 
this way below when we investigate the effect these shifts 
might have on measurements of — . Of course, the shift 
values for nearby pixels are strongly correlated due to our 
smoothing, but this is not important for how we use these 
distributions below. We give the values of the means 
and standard deviations for these histograms in Table [2l 
Again, it is only the width (sigma) of each histogram 
that is relevant for fine-structure constant work, and not 
the overall average shift which is the histogram mean. 

As discussed above, the exposure-dependent, overall 
average velocity shifts are of the magnitude one may ex- 
pect from errors in the position of the QSO in the spec- 
trograph slit. These types of shifts could also be caused 
by grating shifts, temperature/pressure drifts, etc. In 
preliminary work using the UVES iodine cell on expo- 
sures toward bright stars, we do find that varying the 
position of the star within the slit can give shifts between 
Th/Ar and iodine calibrations of around 1500 ms~^. 
making this a likely contributor. 
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Figure 3. Histograms of shifts from the three I2 exposures. Ex- 
posure 1 is plotted in blue (in the middle), exposure 2 in green (on 
the left) and exposure 3 in red (on the right). 



Next, we would like to explore the possibility that there 
is a relatively constant intra-order distortion that repeats 
for each order and is constant between exposures. If the 
intra-order distortion pattern for a given object exposure 
can be well approximated from a subsequent I2 exposure, 
either of the QSO or of a nearby bright star, then one can 
correct the Th/Ar wavelength scale of the former, i.e., 
a sort of calibration transfer function can be established 
and applied. We are interested in the effect of wavelength 
calibration on — , 



Aa/a (X Avij/c, 



(2) 



where Avij is the velocity difference between two lines, i 
and J within the one spectrum. 

Examination of Figures [T] and [3] might give one the 
impression that there is a pattern of intra-order veloc- 
ity offsets that repeats in each echelle order. To test 
this hypothesis, we overlaid all the echelle orders of each 
exposure on their common CCD pixel scale and then av- 
eraged them in bins of 100 pixels. We plot the mean 
and standard deviation in each bin in Figures I3^a)~(c) 
and[5ja)-(c). The dashed lines show the average over all 
bins. These figures thus provide the average intra-order 
distortion for a given exposure. We do this separately for 
each exposure since the distributions of shifts shown in 
Figure[3]show large variation between the exposures. We 
also treat the UVES upper, "u," chip orders separately 
from the UVES lower, "1," chip orders. Thus, we have a 
total of six figures. 

Examination of these figures shows a weak pattern, es- 
pecially for the "u" chip. We see a roughly linear rise 
in Vshih of around 200 ms~^ starting around pixel num- 
ber 1000 and ending around pixel 2900, where there is a 
sharp drop of over 100 ms~^. The "1" chip figures show 
smaller overall variance and while exposures 1 and 2 show 
some rise over the first half of the pixels, the amplitude 
is smaller than for the "u" chip. Exposure 3 of the "1" 
chip does not seem to have any pattern of deviation. Ex- 
amination of the standard deviations listed in Table [5] 
confirms these impressions, especially that most of the 
variance comes from the "u" chip. This is also clear in 
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Figures [T] and [5] where the larger "u" chip variances are 
evident. 

As stated above, the purpose of looking for patterns 
is to find a correction that could be applied to all expo- 
sures. Modeling the "u" chip deviation as a linear rise 
of 200 ms^^ over pixels between 1000 and 2900, we can 
subtract this line from each of the exposures and recal- 
culate the means and standard deviations. The results 
of this "correction" are displayed in Table [2] We only 
expect this to decrease the standard deviations for the 
"u" chip, but show the "1" chip results also for complete- 
ness. We find a small decrease in standard deviations, 
especially for the weighted and sigma-clipped means and 
standard deviations. For example, the variance in expo- 
sure 1 "u" chip goes from 87 ms~^ to 73 ms~^; exposure 
2 and exposure 3 also drop their sigma values by around 
10 ms~^. Wc had hoped for more improvement, but with 
only three iodine exposures, we view this work as illus- 
trative rather than conclusive. More data are needed in 
order to further explore the size and constancy of this 
type of correction. 

4.1. Effect of degree of polynomial in Th/Ar calibration 

We would like to find the source of the intra-order 
wavelength miscalibrations discovered above, especially 
whether they come from the spectrograph and/or tele- 
scope themselves, or from something in the extraction 
and calibration software. In this section we consider the 
extent to which the polynomial fit for the wavelength 
solution contributes to wavelength scale error. Thus, be- 
sides the standard spectrum extraction using a fourth 
degree polynomial fit for the Th/Ar wavelength solution, 
we also extracted and calibrated the spectra using fifth 
and sixth degree polynomials. Figure |6] shows the dif- 
ferences between these Th/Ar solutions vs. wavelength. 
The thick black lines shows fourth degree minus fifth 
degree, thin blue lines show fourth minus sixth degree, 
and dashed red lines show fifth minus sixth. The upper 
panel shows the upper CCD "u" chip, and the lower panel 
shows the "1" chip. We see that there is substantial dis- 
agreement between these wavelength solutions especially 
for the upper CCD and near the edges of the echelle or- 
ders. For the upper chip, disagreements of 30-40 ms"^ 
are typical with 100 ms~^ not uncommon. For the lower 
chip, agreement is typically within 10-20 ms~^, with 
outliers mostly near the trailing edge of the orders. Note 
that it is not clear that using a sixth-order polynomial 
does a better job, since the disagreement between fifth 
and sixth-order (dashed red line) can be appreciable. 

Next, we recalibrated these spectra using the iodine 
cell method. Figure [7] shows the velocity shifts needed 
to bring the Th/Ar calibrations in line with the iodine 
spectrum. The black lines arc for the fourth degree poly- 
nomial fit and are the same shifts plotted in Figures [1] 
and [21 while the thinner red lines are for the sixth degree 
polynomial fit. Note that the distortions in the wave- 
length scale from polynomial fitting (differences between 
the red and black lines) are substantially smaller than 
the total intra-order distortions, especially for the lower 
"1" chip where the red lines are mostly on top of the black 
lines. This shows that polynomial fitting errors arc not 
the primary source of the intra-order distortions. 

We also compared the absolute wavelength scales found 
by the iodine method, Ai(A) in Equation [1] above, for the 



600 



400 



200 



> 



-200 



600 



400 



— , — , — , — I — , — , — , — , — I — , — 
exposure 01, 'u' chip 



ilii-i 



1000 2000 3000 

pixel number 



4000 



200 



Si 
CO 

> 



-200 



600 



400 



— , — 1 — 1 — I — 1 — 1 — 1 — 1 — I — 1 — 1 — 1 — 1 — |- 
exposure 02, 'u' chip 




1000 2000 

pixel number 



3000 



4000 



200 



Si 

> 



-200 




exposure 03, 'u' chip 

1 1 I 1 1 1 1 I 1 



1000 2000 

pixel number 



3000 



4000 



Figure 4. Average over echelle orders of calibration shifts for 
the "u" (upper) chip as a function of pixel number. The echelle 
average calibration shift in ms~^ at each position bin is plotted on 
the ordinate. The standard deviation in the bin of each calibration 
shift is plotted as the error bar. The dashed line shows the average 
over all bins. 



fourth, fifth, and sixth degree polynomial fits. These 
agreed with each other exceedingly well, with RMS dif- 
ferences between the fourth and sixth degree fits of 1 
ms-i or less for the lower "1" CCD chip, and RMS dif- 
ferences of 5 ms~^ or less for the upper "u" chip. This 
is a nice confirmation of the robustness of our iodine fit- 
ting method, since this high level of agreement obtained 
even very near the edges of the echelle orders where the 
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Table 2 

Moans'* and Standard Deviations^ of Calibration Shifts 



Exposure 


Chip 


Unweighted 


Unweighted Corrected*^ 


Weighted and Chpped'^ 


Weighted and Chpped Corrected'"^ 


1 


1 


118 ± 76 


115 ± 73 


133 ± 58 


123 ± 52 


2 


1 


45 ± 86 


42 ± 87 


63 ± 69 


53 ±70 


3 


1 


487 ± 75 


483 ± 85 


485 ±60 


475 ±67 


1 


u 


116 ± 138 


116 ±129 


107 ± 87 


102 ± 73 


2 


u 


± 137 


± 130 


-5 ± 87 


-9 ± 70 


3 


u 


477 ± 174 


475 ±172 


499 ± 115 


495 ± 105 



^ All numbers arc in ni 

^ Each pixel has a correction added before calculation. See the text. 

^ The mean is calculated using the error bars as weights and then recalculated after throwing out points more than 3fT from the mean 



different polynomial fits disagreed greatly. 

Thus, we have several interesting conclusions. First, 
we see from Figure |6] that the polynomial extraction 
algorithm can make a significant contribution to wave- 
length miscalibration, especially in the upper "u" CCD 
and most especially near the edges of the echelle orders, 
where velocity shifts of order 70 ms~^ are common. Sec- 
ond, there is little evidence that a sixth-order polynomial 
does a better job than a fourth-order polynomial, since 
in both cases the robust iodine cell recalibration shows 
similar size miscalibrations. Thus, we caution against the 
use of lines near the edges of the echelle orders calibrated 
only using Th/Ar. Third, the overall contribution to the 
miscalibration variance of the polynomial fitting is small 
compared to other contributions. Thus, better treatment 
of wavelength fitting in spectra extraction will not cure 
the main source of intra-order wavelength miscalibration 
we are finding. 

4.2. Possible causes for the intra-order distortions 

While more work is clearly needed for understanding 
the ultimate cause(s) of the velocity shifts (both the over- 
all shifts between exposures and the intra-order distor- 
tions within individual exposures) , this will probably re- 
quire a larger data set with higher spectral signal/noise 
and/or spanning a range in several parameters such as 
time and telescope pointing. Nevertheless, we briefiy dis- 
cuss some possible causes for the intra-order distortions 
between the iodine and Th/Ar calibrations. 

First, for Keck/HIRES, Griest et al. (2010) cited a 
known misalignment between the Keck telescope optical 
axis and that of HIRES (Suzuki et al. 2003) as a probable 
cause for the regular shape of the intra-order distortions 
from order to order. The misalignment causes differen- 
tial vignetting between the telescope beam and that of 
HIRES (and, therefore, the Th/Ar beam). While we are 
unaware of a similar misalignment in UVES, it remains 
possible that similar differential vignetting is responsi- 
ble for the intra-order distortions found here. In general, 
and despite the use of the simple pupil stop in UVES, 
the science object and Th/Ar calibration light do not 
follow the same optical path, and so we regard a subtle 
differential effect of similar, or even subtler nature to be 
responsible. 

However, other simpler effects may also be suspected 
at first. The very similar shape of the intra-order dis- 
tortions across many orders in HIRES, and the weaker 
but broadly similar tendency revealed in Figure U] and, 
to a lesser extent, Figure [5] for UVES, brings to mind the 
blaze function of echelle spectrographs. For example, one 
may worry that the centroiding of Th/Ar emission lines 



is systematically shifted in opposite directions either side 
of the order center, thereby producing an intra-order dis- 
tortion which, broadly speaking, was similar across all 
orders. However, this possibility can be ruled out: the 
slope produced by the blaze is too small, as shown by the 
following simple considerations. In selecting Th/Ar lines 
best used for calibrating UVES, Murphy et al. (2007) 
measured the velocity shift between two Gaussian fits to 
each line, one with a constant offset continuum, the other 
with a sloping continuum. It was found that a back- 
ground slope of 0.01 (kms^^)^^ in relative line intensity 
resulted in a ^ 160 ms~^ shift in the line centroid if only 
a constant offset continuum was fitted. If the blaze func- 
tion of UVES is not normalized out of the Th/Ar spec- 
tra, then the background intensity will drop by a factor 
of 2 over half the free spectral range (FSR) either side of 
the center of each order. UVES's FSR is - SOOOkms'^ 
Thus, the average slope of the inner half of the blaze func- 
tion is - ±0.5/(0.5 X 3000kms-i) = 3 x 10"" (kms"!)"! 
or, since it is curved, the maximum slope will certainly 
be < 1 X 10^"^. This would translate to a line shift of only 

160 ms~^ X 1 X lO^'^/O.Ol — 16ms~^. This upper limit is 
an order of magnitude below the intra-order distortions 
we observe. 

The above blaze effect will not appear in our UVES 
Th/Ar calibration because the Gaussian fits to the 
Th/Ar lines had a sloped continuum. Also, our Th/Ar 
extractions involve a crude blaze correction; even if this 
did not reliably remove all blaze effects, the resulting 
line-shifts would be smaller than the above calculation 
indicates. For example, the very broad spectral shape 
of the quartz lamp which provides the flat field light 
will remain in our CPL-reduced Th/Ar spectra (and the 
I2 spectra), but the above argument demonstrates that 
such a broad background spectral shape will have negligi- 
ble effects and cannot explain the intra-order distortions 
we find. Similar blaze effects probably play little role 
in the HIRES results of Griest et al. (2010) because the 
data reduction pipeline which provided their main results 
(HIRES REDUX by Jason X. Prochaska) uses a sloped 
continuum. They also used another pipeline (MAKEE 
by Tom Barlow) that used a constant offset continuum, 
but found similar results. That is, any effect from the 
blaze function in the Griest et al. results will be small 
by comparison to the intra-order distortions measured 
there. 

Recently, Wilken et al. (2010) compared the wave- 
length scales established from a laser frequency comb 
(LFC) and a standard Th/Ar lamp in one echelle or- 
der of the HARPS spectrograph on the ESO 3.6 m tele- 
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Figure 5. Same as last figure, but for tlie lower "1" chip. 

scope. The LFC revealed periodic artifacts in the phys- 
ical pixel size and/or sensitivity from the CCD manu- 
facturing process. By appropriately correcting the sen- 
sitivity of every 512th pixel, the wavelength calibration 
residuals were substantially improved. They also found 
distortions between this LFC calibration and that from 
the Th/Ar lamp. Perhaps the CCD manufacturing de- 
fects cause this distortion and contribute to the intra- 
order distortions we find for UVES. This seems unlikely 
because the Th/Ar lines are distributed randomly and 
very sparsely across the CCD grid, so very few (if any) 
Th/Ar lines straddle a pixel with a manufacturing defect. 
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Figure 6. Differences in Tli/Ar wavclengtli solutions for orders 
occurring on the upper "u" CCD chip (upper panel) and lower "1" 
CCD chip (lower panel). The thick solid black lines show the differ- 
ence in wavelength scale between extractions done with a fourth- 
order polynomial and a fifth-order polynomial. The thin solid blue 
lines show the fourth-order minus the sixth-order, and the dashed 
red lines show the fifth-order minus the sixth-order. The beginning 
of each order is shown by a big dot and exposure number is labeled. 
Notice the different scales on the upper and lower chips. 

In fact, the Th/Ar-LFC distortions found by Wilken et 
al. seem similar in shape and magnitude to those we find 
for UVES between different Th/Ar solutions using dif- 
ferent polynomial degrees; compare Figure [6] with their 
Figure 4 (blue line). 

Finally, we note that a similar method for discover- 
ing shifts in the Th/Ar calibration of UVES has been 
demonstrated by Molaro et al. (2008b). They compared 
the solar spectrum reflected from asteroids with a labo- 
ratory FTS solar spectrum to establish the wavelength 
scale of UVES and compare it with the Th/Ar solution. 
First, it is interesting to note that they only find small 
(10-50 ms~^) average velocity shifts for different expo- 
sures, possibly indicating that accurate centroiding of the 
asteroids within the 0.5" wide slit was achieved. How- 
ever, only the line cores of a few solar absorption features 
per echelle order were utilized, so it is difficult to assess 
whether the intra-order distortions we find here are re- 
flected in the results of Molaro et al. (2008b). 
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show results for the fourth degree polynomial fit, while the thin 
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5. EFFECT ON ^ 

a 

As stated in the introduction, wavelength cahbration 
errors of the magnitude presented in this paper are unim- 
portant for most astronomical work. In this section wc 
wish to address the question of whether or not calibra- 
tion errors such as we are finding can make a difference 
in experiments looking for changes in In order to 

proceed we need an estimate of the magnitude of ^ we 
wish to be sensitive to. There are at least two possible 
sources for this number: first, ^ = (— 5.7± 1.1) x 10~^, 
which is the most robust claimed detection of — (Mur- 
phy et al. 2001a, 2001b, 2003, 2004), and second there 
are several claimed limits on — near — < 1 x 10~^ 

a a 

(Chand et al. 2004; Srianand et al. 2004; Levshakov 
et al. 2006, 2007; Molaro et al. 2008a; cf. Murphy et 
al. 2008). Thus, the question we would like to ask is 
to see whether the UVES wavelength calibration errors 
discussed above, can alone give rise to estimates of — 
in the 1 x 10~^ to 5 x 10~^ range. 



Naively, one might expect the calibration errors of 200 
m s~^ to 500 m s~^ we found above to give rise to spurious 
detections of — of order 1 x 10~^. If a was different in 
the past, the velocity shift of an absorption line from the 
lab value due to changing a can be found from 



Aa 



-2cqjX 



(3) 



where j numbers the atomic transitions that are being 
compared, vq is a constant offset (degenerate with the 
system redshift), the Xj are constants that depend only 
on the wavelength Aq of the transition, and the qj char- 
acterize the sensitivity of each transition to a change in 
For transitions of interest in QSO absorption work, 
these 5- values range in magnitude from around 20 cm~^ 
to 2000 cm~^ (Porscv et al. 2007), giving rise to velocity 
shifts of a few meters per second up to around 100 ms~^ 
(for a value of ^ = -5.7 x lO"*^). Thus, for individual 
transitions, the sizes of the wavelength calibration errors 
found above are of the order of or larger than the sizes 
of the signal expected from a changing a. 

However, the value of ^ obtained does not depend 
upon just one transition, but is always a comparison of 
two of more transitions. Thus, an overall velocity shift of 
500 ms^^ as seen in exposure 3 above is not relevant as 
long as comparisons are not done across different expo- 
sures. Figures [T] and [2] show that the relative shift across 
one exposure is substantially smaller than the overall 
shift. In addition, if the calibration shift error is ran- 
dom, and equally likely to be negative as positive, then 
it can be removed by averaging over many transitions 
and many lines of sight. Potential problems do exist if 
the distribution of relative shifts is not random, or if the 
sizes of the errors are large enough that a good measure- 
ment requires too many transitions or too many lines of 
sight. Thus we wish to perform some mock experiments 
using the above distributions of calibration shifts to see 
what the effect on — would be. 

a 

In an actual experiment, a value of is estimated 
by doing a large joint fit of the Voigt line profiles, the 
system redshifts, and the possible velocity offset due to 
In our Monte Carlo mock experiments we try to 

calculate a value of — using the UVES VLT data, but 
without measuring any actual absorption lines. Instead 
of using the fitted system redshifts to find velocity offsets, 
wc use the wavelength calibration offsets given by the 
fine-binned (blue and red) lines in Figures [1] and [21 and 
add these to the lab values of Xq in Equation Q . 

The basic method is as follows. We first choose a 
random redshift in the range z = 0.2 to z = 3.7, and 
calculate the wavelengths of the 23 atomic transitions 
that were studied in Murphy et al. (2003). We define 
-^tran, the uumbcr of these 23 transitions that fall at 
wavelengths for which we have iodine wavelength cali- 
bration. We require at least Nj^in transitions, and show 
our results as a function of this -/Vmin (a typical value is 
-^min = 4, and wc do not find any cases with iVtran > 9). 
For each such transition wc shift its wavelength by an 
amount given by the finc-binncd (blue or red) line from 
one of the exposures in Figures [1] and [2l We then perform 
a fit of Equation ^ for ^ and its error. This counts 
as one absorption system, and we repeat this procedure 
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A'sys times, averaging the values of ^ obtained. We con- 
sider values of TVgys ranging from Ngys = 143, the number 
of systems used in Murphy et al. (2004), to A^sys = 1, 
the value when only one system in one QSO is being ana- 
lyzed. The above procedure constitutes one Monte Carlo 
experiment. We repeat the experiment many times to 
find an average value of — and its standard deviation 
(measured by the variance of ^ for the many experi- 
ments). 

We present resulting average values of ^ along with 
their standard deviations as a function of Ngys and A'min 
in Table |3] for 200,000 mock experiments. The table 
shows results for Ngys = 143 and Ngys — 1. Note one 
expects the standard deviation in — to simply scale as 

Nsys , which is close to what we find in Table [H Thus, 
we will use this scaling from now on and only report 
Monte Carlo experiments for Ngys — 1. 

Besides using the actual wavelength calibration errors 
above, we also ran several Monte Carlo simulations using 
two simple models of the calibration offsets. The results 
of these simple models are also reported in Table [31 For 
the first model we used a Gaussian random velocity offset 
with a standard deviation equal to 91 ms^^. For the 
second, we modeled the velocity offsets as a sine function 
with amplitude equal to tt/2 times the standard deviation 
of the velocity offsets for one of the exposures above, and 
with a wavelength of about one echelle order. For this 
case, we found the results did not depend strongly on 
the sine wavelength. As seen in Table |3] the results for 
cr(-^) are quite similar for all three exposures and for 
the Gaussian and sine function models. 

Table |3] shows several interesting things. First in all 
cases when iV,„in = 2, both the mean calibration offset 
and standard deviation in are substantially larger 

than expected from a simple l/ViVtran scaling. We 
think this is due to occasional cases where there are 
very few transitions found, but these lie close together 
in Xj = ~2cqj\j. Since is basically the slope in 
Equation ([3]), a small Aa; offset can result in a very large 
slope and therefore a large error in ^ . It takes just a few 
such cases to greatly increase the standard deviation. A 
lesson here may be not to use systems in which very few 
transitions can be compared. For example, in the alkali 
doublet method (e.g., Bahcall et al. 1967, Varshalovich 
et al. 2000, etc.) two transitions that are close together 
in wavelength are compared, so this method would be 
sensitive to intra-order distortions. More generally, even 
for A'niin = 4 and N^^ia = 6, we find the errors dropping 
more quickly with than I/a/ZV. 

Next, we note that the results for all three exposures 
and for the Gaussian and sine function error models are 
quite consistent, especially when one takes into account 
that the standard deviation of velocity offsets for expo- 
sure 1 is slightly smaller than for the other exposures. 
Also as expected, the large overall velocity shift for ex- 
posure 3 had no effect. 

If we restrict ourselves to the A^min = 6 column, and 
consider N^ys — 143, we see that the systematic error 
introduced to a many multiplet measurement of is 
around 0.28 x 10^®, significantly smaller than the statis- 
tical error of 1.16 x 10"^ stated in Murphy et al. (2003, 



2004). We do note that Murphy et al. (2003, 2004) used 
the Keck HIRES spectrograph and not the VLT-UVES 
instrument. 

An important problem with the above results is that 
the value of and its standard deviation depends 
strongly on A^tran, the number of transitions compared 
in each system, and that no cases were found with 
A'tran > 9. This latter fact is because the results above 
only included lines that overlapped with our iodine cell 
coverage. Thus, A^tran found in our Monte Carlo experi- 
ments are artificially lower than in an actual experiment, 
which typically has more spectral coverage. Since the 
value of cr(^) drops quickly with an increase A'^tran, we 
also expect an actual experiment to find smaller devia- 
tions than the ones we report. We find this low value of 
A'tian to be especially true for certain values of z, where 
very few interesting lines fall within our iodine cell cov- 
erage. Our attempt to get around this by setting a mini- 
mum number of transitions, A^min, was partially success- 
ful, but Table [3] shows strong dependence on A"min- Even 
more clearly, we see this in Table |4] where the values of 
sigma of ^ depend on A^tran substantially more strongly 
than l/\/Ntian. We note that when A'tran is specified in 
the table, exactly Ntmn transitions are compared, while 
when A'min is specified all cases that have A'tran > A^min 
are used. 

To remedy this situation, we need to somehow estimate 
the calibration offsets in regions of the spectra where 
we do not have iodine cell coverage. We attempt to do 
this by replicating the calibration offsets from the regions 
where we measure them to all the other spectral regions 
where interesting transitions occur; i.e., we assume that 
the distributions of shifts illustrated in Figures \T\ and 
[2] apply to all wavelengths. We then repeat the Monte 
Carlo experiments above. 

Results of these simulations are shown in Figure [5] and 
in Tabic [5] Now we find few systems with less than 
10 transitions and many with around 18. The values of 
o'(^) for low values N^un are similar to those found in 
the iodine coverage only Monte Carlos, again depending 
very strongly on A^min (or A^tran)- However, for larger 
values of A^min, f (^) stabilizes and approaches the ex- 
pected 1 / V-^tran bchavior as M„in increases. 

The strong dependence of cr(—) on A'^min is interesting 
and suggests that the standard Tore that says it is better 
to use transitions in the same echelle order may not be 
true. The wavelength calibration errors we found above 
exist even within single echelle orders. Thus, it may be 
more robust to measure or limit using transitions 
that are well separated in x; the extra "lever arm" may 
be advantageous in reducing scatter. 

Overall, the large ct's found at small A'tran imply that 
due to wavelength calibration errors it may be dangerous 
to attempt to measure ^ using only a few transitions 
in one or two systems. Until better understanding is 
found as to the source of these calibration errors it is 
probably important to average the errors away by using 
many transitions in many absorption systems. 

We also see from Figure [8] that the mean and stan- 
dard deviation for all three exposures agree well, and 
that the three experimentally determined measurements 
of cr(-^) are well modeled by a Gaussian (or sine func- 
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200,000 Monte Carlo realizations were used with A^s- 



1. 



tion) model with the same velocity standard deviation. 
Thus, it seems that the error in caused by this wave- 
length calibration is completely specified by just the stan- 
dard deviation of the velocity offsets. In order to quantify 
this we ran a suite of simulations using various values of 
(t(u). We found we could fit all the results for cr{^) 
with a fairly simple formula: 



a 



(iV,y,iVtran)5 



(4) 



where a(v) is in ms , and Cm ~ 1 (to within 10%) 
for TVtran > 8, and Cm = 21 for A^tran = 2, Cm = 5.1 
for iVt,an = 3, Cm = 1.7 for TVtran = 4, Cm = 1-3 for 
Nu-an = 5, and Cm = 1-1 for A'^tran = 6 or TVtran = 7. The 
simulations were done with our ansatz for full wavelength 
calibration between 3000A and 10,500A. 

6. CONCLUSIONS 

We tested the accuracy of the wavelength calibration of 
the VLT UVES spectrograph by recalibrating the stan- 
dard Th/Ar calibration pipeline output using spectra 
taken though the UVES iodine cell. We found several 
types of miscalibration: first, an exposure-dependent 
overall average velocity shift that is probably mostly 
due to the position of the QSO within the spectrograph 
slit. Repositioning of the spectrograph gratings and pres- 
sure/temperature drifts inside the spectrograph may also 
contribute to these velocity shifts. These overall average 
shifts range up to 500-600 ms~^, but do not affect fine- 
structure constant measurements as long as comparisons 



between absorption features are only performed within 
the same spectrum. 

Second; and more importantly, we found intra-ordcr 
calibration shifts of up to several hundred meters/second 
that occur within the same spectrum and within each 
echelle order. These can affect measurement of the 
fine-structure constant. We investigated several possible 
causes for these shifts. We explored the effect of the de- 
gree of the polynomial used in the Th/Ar calibration and 
found a velocity dispersion of 20-40 ms^^ due to this. 
This is several times smaller than the total dispersion we 
find in the velocity measurements, so the main effect we 
are finding is not due to the degree of the Th/Ar calibra- 
tion polynomial. In fact, we found that the iodine cell 
recalibration of extractions using different degree polyno- 
mials allowed us to regain the same absolute wavelength 
scale to an accuracy of better than 5 ms~^, thus show- 
ing the robustness of our iodine recalibration method. 
We considered several other possible causes for the intra- 
ordcr shifts and showed that spectrograph blaze function 
and related possibilities were unlikely. Thus, we conclude 
that most likely there are unknown hardware-related sys- 
tematic errors within UVES and/or the VLT responsible 
for these shifts. We noticed some weak patterns in the 
shift as a function of CCD pixel and attempted a cor- 
rection based upon these patterns. This correction was 
not very successful, so we conclude that additional data 
and analysis arc required to discover the cause and po- 
tential fixes for these wavelength scale shifts. We note 
that it is important look for such a correction, since if 
the intra-order distortion pattern for a given object ex- 
posure could be well approximated from a subsequent I2 
exposure, either of the QSO or of a nearby bright star, 
then one could correct the Th/Ar wavelength scale of the 
former, i.e., a sort of calibration transfer function could 
be established and applied. 

We next explored the effect of these shifts on determi- 
nation of or limits on the value of the fine-structure con- 
stant. Using our measured shifts we performed Monte 
Carlo experiments and found that the effect of the intra- 
ordcr shifts is well modeled by a Gaussian dispersion in 
wavelength calibration with a magnitude of around 80- 
120 ms^^. We found that the effect on ^ depended 
strongly on the number of lines being compared within 
the spectrum, with the use of a small number of lines re- 
sulting in larger than expected errors. Thus, until a cor- 
rection to these intra-order wavelength miscalibrations 
is found, we recommend against focusing on compari- 
son between pairs of lines or the use of spectra which 
contain only a few lines. We summarized the results of 
our Monte Carlo experiments in a fitting formula Equa- 
tion Q, which should be useful in estimating the effect 
of these (or similar) wavelength calibration problems on 
future fine-structure work. 
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^ Only transitions within the iodine cell coverage are included from 200,000 realizations of A^sys — 1. 



Table 4 

Standard Deviation^ of from Wavelength Calibration Errors as a Function of the Number of Transitions. 
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Table 5 

Monte Carlo Results^ for Mean and Standard Deviation of 



TV ■ 


Exposure 1: Mean/10~8 


Exposure 1: cr/lO 


Gaussian: Mean/10 " 


Gaussian: cr/lO ^' 


2 


-0.018 


81.2 


0.185 


91.6 


4 


-0.101 


4.99 


-0.036 


5.64 


6 


-0.064 


2.59 


-0.028 


3.04 


8 


-0.051 


2.00 


-0.006 


2.38 


10 


-0.042 


1.71 


-0.005 


2.03 


15 


-0.029 


1.30 


0.004 


1.59 


20 


0.153 


1.05 


0.007 


1..35 



^ All of the 23 transitions falling between 3000A and 10500A are included (sec the text), for 200,000 realizations of Nsys — 1- 



